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Abstract 

Multiple zeta values have been studied by a wide variety of methods. In this 
article we summarize some of the results about them that can be obtained by an 
algebraic approach. This involves "coding" the multiple zeta values by monomials 
in two noncommuting variables x and y. Multiple zeta values can then be thought of 
as defining a map C : i^*^ ^ R from a graded rational vector space Sj^ generated by 
the "admissible words" of the noncommutative polynomial algebra Q{x, y). Now S)^ 
admits two (commutative) products making a homomorphism-the shuffle product 
and the "harmonic" product. The latter makes S^^ a subalgebra of the algebra 
QSym of quasi-symmetric functions. We also discuss some results about multiple 
zeta values that can be stated in terms of derivations and cyclic derivations of 
Q{x,y), and we define an action of QSym on Q{x,y) that appears useful. Finally, 
we apply the algebraic approach to relations of finite partial sums of multiple zeta 
value series. 

1 Introduction 

The last fifteen years have seen a great deal of work on the multiple zeta values (MZVs) 

where 11,12, ... ,ik are positive integers. The case k = 2 goes back to Euler [HI, and was 
revisited by Nielsen ^] and Tornheim [SH]- The general case was introduced in ^\ and 
These quantities have appeared in a surprising variety of contexts, including knot 
theory [23], quantum field theory and even mirror symmetry pUj . 

Much work on MZVs has focused on discovering and proving identities about them, 
particularly those that express MZVs of "length" {k in equation (1)) greater than one 
in terms of ordinary (length one) zeta values. Even in the length-two case, it appears 



that there are MZVs that are "irreducible" in the sense that they can't be expressed 
(polynomially with rational coefficients) in terms of length one zeta values, e.g., ({Q,2). 
(Of course it isn't known how to prove even that C(3)^/C(2)^ is irrational, so we have 
to say "appears": but everyone since Euler who has looked for some reduction of C(6,2) 
hasn't found one.) 

Many approaches have been used to obtain MZV identities. Analytic techniques are 
emphasized in the surveys jS] and [Sj. In this article we will focus on algebraic techniques. 
It is evident that sums of form (1) constitute an algebra by simple multiplication of series: 
this was the starting point of [TRj, which formalized the "harmonic algebra" of MZVs. 
But, as has become fairly well known by now, there are two distinct algebra structures 
on the set of MZVs, the harmonic (or "stuffie" ) algebra and the shuffie algebra. 

Before proceeding further it is useful to introduce an algebraic notation for MZVs. Se- 
ries of form (1) can be specified by the composition (finite sequence of positive integers) 
{ii,i2, ■ ■ ■ yik)', to this composition we assign the word x'^^~^yx^^~^y ■ ■ ■x^''~^y in noncom- 
muting letters x and y. A series of form (1) converges exactly when ii > 1, i.e., when the 
corresponding word starts with x and ends with y. We call such words "admissible", and 
we can think of ( as assigning a real number to each admissible word. (It is convenient 
to treat the empty word 1 as admissible and set ({1) = 1.) Note that if w is the word 
corresponding to a composition (ii, . . . , 4), the weight ii + ■ ■ ■ + ik is the total degree \w\ 
of w. In this case the length k of the composition is the ^/-degree of w] we denote this by 
i{w). We will find it convenient to call \w\ — i{w) (i.e., the x-degree of w) the colength of 
w, denoted c{w). 

Let be the underlying rational vector space of Q(x,?/), and let 5^° be the subspace 
generated by the admissible words. Then we think of C as a Q-linear map ^ : — > R. 
Now X and y are not admissible, but Sj^ is a noncommutative polynomial algebra on 
the words Vp^q = x^y'^ for p, g > 1 (Of course ( is not a homomorphism for this algebra 
structure). We call the length of a word w G in terms of the Vp_q its height, denoted 
ht(w). For example, ht{xyx'^y'^) = 2. 

With this notation, it is easy to state two identities whose proof motivated much of 
the early work on MZVs, the sum theorem and the duality theorem. (Both appeared 
in J7j as conjectures: the sum theorem was proved by Granville ^3] and independently 
by Zagier; the duality theorem was proved via the iterated integral discussed below-see 
[SH] -unfortunately without any notice of the conjecture!) The sum theorem can be stated 
as 

E c(«^) = c(^) 

w&SjP, \w\=n, £{w)=k 

for n > 2. For the duality theorem, define an antiautomorphism r of the noncommutative 
polynomial ring Q(x, y) by r(x) = y and r(y) = x; note that r is an involution that 
exchanges length and colength, and preserves height. The duality theorem states that 

C(^) = C{t{w)) 

for admissible words w. 
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Another of "early" results on MZVs was the Le-Murakami theorem of [23] . This is 
the identity 



weS^O, \w\=2n, ht(M))=A: i=0 ^ / 

which they proved by examining the Kontsevich integral of the unknot. 

One reason for the efficacy of the "algebraic" notation is apparent-it corresponds to 
the expression of MZVs by iterated integrals, as follows. Let w = aia2---an be the 
factorization of an admissible word into x's and y's. Then it is easy to show that 

^/ N _ /"*" Z"*^ dti dtn-i dtn , . 



JQ 



where 

Mt) 



if Oi = x, 
t, if Oi = y. 



The duality theorem follows immediately from the change of variable (ti, . . . , t„) ^ (1 — 
t„, . . . , 1 — ti) in the iterated integral. In addition, the fact that iterated integrals multiply 
via shuffie product (see the Section 2 below) implies the existence of the shuffie product 
structure on the set of MZVs. 

But the series multiplication (or "stuffle product") can also be formated in terms of the 
algebraic notation; this is the "harmonic algebra" of [18J. The formulation in jTH] led to 
the discovery that the harmonic algebra of MZVs is a subalgebra of the quasi-symmetric 
functions. We discuss this in detail in Section 3. 

Another remarkable success for the algebraic method is the result of J7] that I have 
since (see |2I]) called the derivation theorem. Let D be the derivation of Q(x,?/) with 
D{x) = and D{y) = xy. Then D takes S)^ to itself, as does the derivation tDt. We can 
state the derivation theorem as 

aD{w)) = aD{T{w))) 

for admissible words w. The proof of this in jT7] is an elementary but messy partial- 
fractions argument. It seems to have nothing to do with iterated integrals, but the 
algebraic notation is working some magic here-just compare the formulation above with 
the one given as Theorem 5.1 of jTTj: for any admissible composition {ii, . . . ,ik), 

k ij-'i 

C{ii, ...,ij + l,...,ik)= ^ ^ C(^i, • • • , ij-i, ij - P, P + 1, ij+i, ■■■Jk)- 

j=l l<j<k P=0 

The sum, duality, and derivation theorems are all subsumed in a remarkable identity 
proved in 1999 by Ohno [29jj. It can be stated nicely in the algebraic notation, but to do so 
will require some more machinery: see Section 4 below. More recently, I conjectured, and 
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Ohno proved, a somewhat mysterious "cyclic" analogue of the derivation theorem |21j . 
As with the derivation theorem, the statement in the algebraic notation is very simple, 
but the proof is a tricky partial-fractions argument. We discuss this in Section 5. 

The "magic" of the algebraic notation seems to extend to the finite partial sums of 
the MZVs. Here the harmonic algebra still applies, although the shuffle algebra does not. 
In Section 6 we state some results on finite multiple sums, including some mod p results 
{p a prime). The main result of this section appears to be new. 

2 The Shuffle Algebra 

As above, let ^ be the underlying graded rational vector space of Q(x, y), with x and y 
both given degree 1. We define a multiplication uj on by requiring that it distribute 
over the addition, and that it satisfy the following axioms: 

51. For any word w, 1 U-i w = w U-i 1 = w; 

52. For any words Wi,W2 and a, 6 G {x, y}, 

awi Lu bw2 = a{wi lu bw2) + b{awi lu ^2)- 

Induction on total degree then establishes the following. 

Theorem 2.1. The lu -product is commutative and associative. 

Recall from the previous section that r is the anti-automorphism of Cl{x,y) the ex- 
changes X and y. Then we have the following fact. 

Theorem 2.2. r is an automorphism of (S), lu ). 

Proof. Since evidently r2 = id, it sufflces to show that r is a lu -homomorphism. Using 
the axioms SI, S2 above and induction on IW1W2I, it is straightforward to prove that 

Wia LU = {wi LU W2b)a + [wia lu W2)b 

for any words wi,W2 and letters a, 6. Now suppose inductively that t{wi lu W2) = 
t{wi) lu t{w2) for |wi'u;2| < n, and let wi,W2 be words with |tyiW2| = n. We can as- 
sume both wi and W2 are nonempty; write wi = w[a and W2 = ^36. Then 

t{wi lu ^2) = t{{w[ lu W2)a + {wi lu W2)b) 

= T{a)T{w[ LU W2) + T{b)T{wi LU w'2) 

= r(a)(r(^) lu t{w2)) + r(6)(r(w;i) lu t{w'2)) 
= T{a)r{w[) LU T{b)T{w2) 
= t{wi) lu t{w2). 

□ 
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Now order the words of 9j as follows. For any words 1^1,^2,^3, set W1XW2 < Wiyws; 
and if u, v are words with v nonempty, set m < . A nonempty word w is called Lyndon 
if it is smaller than any of its nontrivial right factors; i.e., w < v whenever w = uv and 
u ^ 1 ^ V. From [3T] we have the following result. 

Theorem 2.3. As a commutative algebra, {Sj, lu ) is freely generated by the Lyndon 
words. 

The link between the shuffle algebra and MZVs is given by the iterated integral rep- 
resentation (2), together with the well-known fact jn2] that iterated integrals multiply by 
shuffle product. We can state this as follows. 

Theorem 2.4. The map ( : (S)^, lu ) R zs a T-equivariant homomorphism. 

The shuffle-product structure has been used to prove some MZV identities. For ex- 
ample, in ^ it is first established that 

n 

r=—n 

in 9), and then ( is applied to get 

n 

r=—n 

Using the known result 

2k 

(for which see the remarks following Theorem 3.5 below), together with some arithmetic, 
one then obtains the result conjectured by Zagier P9j several years earlier: 

c((xV)") = TT^aixyrn- 

2n + 1 

Other shuffle convolutions are used to prove some instances of the "cyclic insertion con- 
jecture" for MZVs in the same paper, and the topic has been revisited in 

3 The Harmonic Algebra and Quasi- Symmetric Func- 
tions 

We can define another commutative multiplication * on ^ by requiring that it distribute 
over the addition and that it satisfy the following axioms: 

HI. For any word w,l*w = w*l=w; 
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H2. For any word w and integer n > 1, 

x"- * w = w * x"^ = wx"'; 

H3. For any words Wi, W2 and integers p,q > 0, 

x^ywi * x'^yw2 = x^y{wi * x'^yw2) + x'^y{x^ywi * W2) + x^^'^^^y{wi * 102). 

Note that axiom (H3) allows the *-product of any pair of words to be computed recursively, 
since each *-product on the right has fewer factors oiy than the ^-product on the left-hand 
side. Induction on ^/-degree establishes the counterpart of Theorem 2.1. 

Theorem 3.1. The ^-product is commutative and associative. 

We refer to ^ together with its commutative multiplication * as the harmonic algebra 
(i^,*). Evidently r is not an automorphism of (^,*). But we do have counterparts of 
Theorems 2.3 and 2.4, which are proved in [T^. 

Theorem 3.2. As a commutative algebra, (^, *) is freely generated by the Lyndon words. 

Theorem 3.3. (^°, *) is a subalgebra of {:^, *), and ( : (^°, *) H is a homomorphism. 

Because the multiplications * and lu are quite different. Theorems 2.4 and 3.3 imply 
that C has a large kernel. For example, since 

xy * xy = 2{xy)'^ + x^y 
xy LU xy = 2{xyY + Ax^y"^ 

we must have 

C{x^y - Ax^y^) = 0. 

In fact, it has been conjectured that all identities of MZVs come from comparing the two 
multiplications. The derivation theorem can be recovered, since 

yujw — y*w = tDt{w) — D{w) 

for w & S)^ (Theorem 4.3 of j^). Zudilin ^IJ states the conjecture as 

kerC = {u Luv — u*v\ue :^^,v G 

for other formulations see JH] and 

Let S^^ be the vector subspace Ql + Sjy of S); it is evidently a subalgebra of {Sj, *). 
In fact, since x is the only Lyndon word ending in x, it is easy to see that Sj^ is the 
subalgebra of (i^, *) generated by the Lyndon words other than x. Note that any word 
w & Sj^ can be written in terms of the elements Zi = x^~^y, and that the ^/-degree ^{w) is 
the length of w when expressed this way. We can rewrite the inductive rule (H3) for the 
^-product as 

ZpWi * ZqW2 = Zp{wi * ZqW2) + Zq{ZpWi * W2) + Zp+q{wi * W2). (4) 



6 



Now for each positive integer n, define a map 0^ : ^ Q[ti, . . . , t„] (where \ti\ = 1 
for all i) as follows. Let 0ri(l) = 1 and 

0n(^n^j2 ' ' ' ^ik) ~ ^n\^n2 ' ' ' ^nl 

n>ni>n2>-->nfe>l 

for words of length k < n, and let = for words of length greater than n; extend 

(f)n linearly to Because the rule (4) corresponds to multiplication of series, 0„ is a 
homomorphism, and (pn is evidently injective through degree n. For each m > n, there is 
a restriction map 

Pm,n ■ Q[ti, 

such that 

PiU) 

The inverse limit 

<P = proj limQ[i(:i, . . . ,t„] 

n 

is the subalgebra of Q[[ti, ^2, • • • ]] consisting of those formal power series of bounded de- 
gree. Since the maps 0„ commute with the restriction maps, they define a homomorphism 
: ^1 ^ <P. 

Inside ^ is the algebra of symmetric functions 

Sym = proj lim Q [t i , . . . , t„] ^" 

n 

and also the algebra of quasi-symmetric functions (first described in [T^). We can define 
the algebra QSym of quasi-symmetric functions as follows. A formal series p G ^ is in 
QSym if the coefficient of t^^^ ■ ■ ■ t^^^ in p is the same as the coefficient of t^^ ' ' '^^1 P 
whenever ii < 12 < ■ ■ ■ < ik and ji < 32 < ■ ■ ■ < jk- Evidently Sym C QSym. A vector 
space basis for QSym is given by the monomial quasi-symmetric functions 

n<*2<---<«fc 

which are indexed by compositions (pi, . . . Since evidently ' ' ' ^ik) — ^{ik,-M)^ 
we have the following result. 

Theorem 3.4. cf) is an isomorphism of Sj^ onto QSym. 

As is well known, the algebra Sym of symmetric functions is generated by the elemen- 
tary symmetric functions Cj, as well as by the power-sum symmetric functions pi (Note 
that we are working over Q). It is easy to see that (f)~^{ei) = z\ and 4>~^{pi) = Zi. Let 
Sym° be the subalgebra of the symmetric functions generated by the power-sum symmetric 
functions Pi with i > 2. Then 0~^(Sym) D = 0~^(Sym°). Since is a homomorphism, 
we have the following result. 



I ^m] Q[^l) ■ ■ ■ ; ^71 



t,-, i < n 



0, i > n. 
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Theorem 3.5. If a E cj) ^{Sym^), then ({a) is a sum of products of values of of the 
zeta function with i > 2. 

In fact, the problem of expressing MZVs C{a) with a G 0^^(Sym°) in terms of values of 
the zeta function is entirely equivalent to writing particular monomial symmetric functions 
in terms of power-sum symmetric functions pi, for which there are well-known algorithms 
^E^. This includes cases like 

Cizf) = C(i,i,...,i) 

(Note i = 2 occurs in equation (3) above), treated by analytical methods in P^. For 
example, since M22 = \{P2~ Vi) iii Sym°, we have 

<(2.2)4(c(2)^-c(4))4g-g^i;;. 

(For a general proof of equation (3) by this method, see Corollary 2.3 of [T7].) 

Since y is the only Lyndon word that begins with we can write SS^ = (for 
either the lu or the * product). So we can extend C to a map ^ : ij^ — R by defining 
({y). Since 

y*y = 2y'^ + xy and y lu y = 2?/^ 

there is no way to do this consistently for both multiplications, but if we restrict our 
attention to the ^-multiplication it turns out that ({y) = 7 (Euler's constant) is a happy 
choice. If 

H{t) = l + yt + {y'^ + xy)t^ + (y^ + yxy + xy"^ + x^y)t^ H 

is the generating function for the complete symmetric functions, then the following result 
is easy to show (see 



Theorem 3.6. C(^(^)) = r(l -t). 

Now one can show (e.g., using differential equations) that 

^-^ r{i — u — v) 

Putting this together with Theorem 3.6, we have 

^ C(u.)..*V« = c(i-^«). (5) 



weSjO, ht(io)=l 



Hence C('^) ^ C(0~^(Syni°)) for any word w of height 1 (i.e., of the form x^y'^), and thus 
can be written in terms of ordinary zeta values ({n) = C{zn)- 

Remarkably, Ohno and Zagier |30j have recently proved that equation (5) is just the 
constant term of the following result. 
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Theorem 3.7. 



J2 aw)u'^'"V^'"^z'''^^^ 



-1 



l-z 



1 




H{u)H{y) 
H{a)H{(3) 



) 



where 



a = - ({u + v) + ^/{u + vy — Auvz 



[3 = - Uu + v) — \/{u + vy — Auvz 



The theorem implies that any sum of MZVs of fixed weight, length, and height, e.g.. 



is in C(0~^(Sym°)) and hence expressible in terms of C,{nys. But the theorem implies 
much more. For example, taking the limit as 2; — > 1 gives the sum theorem, and setting 
V = — M gives the Le-Murakami theorem. 

For another application of Theorem 3.6 see PU] . 

4 Derivations and an Action by Quasi- Symmetric Func- 
tions 

As mentioned in the introduction, the derivation theorem has a far-reaching generalization 
proved by Ohno [SHI- In this section we give a succinct statement of Ohno's theorem and 
some of its equivalents using the Hopf algebra structure of QSym. 

We begin by motivating the use of a Hopf algebra structure in this context. (The 
standard references on Hopf algebras are and ^7\, but the reader may find a source 
like more convenient.) Let be an algebra of operators (with composition as multipli- 
cation) acting on an algebra A. Then elements of the tensor product ® act naturally 
on products pq for p,q & A: a ® P{pq) = a{p)P{q). To say that a G is a derivation is 
to say that the action of a on a products agrees with the action oi a ^ 1 + 1 ^ a: 



A Hopf algebra structure on is essentially a "coproduct" A : ^ ® compatible 
with the multiplication in 0. We require that a{pq) = A{a){pq) for all a G 0. Elements 
a with A[a) = a^l + l^a are called primitive, so the primitives in are exactly those 
that act as derivations. The "fine print" of the definition of a (graded connected) Hopf 
algebra requires that A(q;) always contain the terms a ® 1 and 10a for a of positive 
degree, so primitive elements are those whose coproducts are as simple as possible. We 



C(m) = C(3, 2, 1) + C(2, 3, 1) + C(2, 1,3) + C(3, 1, 2) 



\w\=6, ht(w)=2, i(w)=3 



a{pq) = a{p)q + poi{q) = {a®l + \® a){pq). 



9 



can generalize the notion of derivation by allowing extra terms in the coproduct. For 
example, a set {ao = l,ai,a2, ■ ■ ■} of elements is called a set of divided powers if 

A(a„) = ^ (g) aj] 

i+j=n 

if we think of the a„ as operators, they are sometimes called a "higher derivation". Thus, 
a Hopf algebra of operators is a natural extension of the notion of a Lie algebra acting by 
derivations. 

Now (S)^, *) = QSym has a Hopf algebra structure with coproduct A defined by 

n 
j=0 

(and counit e with e{u) = for all elements u of positive degree). This extends the 
well-known Hopf algebra structure on the algebra Sym (as described in JUl), in which 
the elementary symmetric functions Cj (<-^ y*) and complete symmetric functions hi are 
divided powers, while the power sums pi (<-> Zi) are primitive. The Hopf algebra {Sj^, *, A) 
is commutative but not co commutative. Its (graded) dual is the Hopf algebra of noncom- 
mutative symmetric functions as defined in [TT] . 

Now define ■ : S)^ ® Q{x,y) Q(a;, y) by setting 1 ■ w = w for all words w, 

Zk-l = 0, Zk-x = 0, Zk-y = x^y 

for all /c > 1, and 

u ■ W1W2 = ^^(w' ■ wi){u" ■ W2) (6) 

u 

where A(u) = ® u"; the coassociativity of A insures this is well-defined. It turns 

out (Lemma 5.2 of [21]) that u ■ w just consists of those terms of u *w having the same 
y-degree as w, so it follows (from the associativity of *) that ■ is really an action, i.e., 
u- {v ■ w) = {u*v) ■ w. Also, equation (6) says the action makes Q(x, y) a QSym-module 
algebra, in the terminology of [23 . 

We note that the action of zi on Q{x,y) is just the derivation D defined in the 
introduction, since zi ■ x = and zi - y = xy. In fact, for each n > 1 we have a derivation 
Dn given by Dniw) = Zn ■ w, since the Zn are primitive in QSym. 

In terms of this action, we can now state Ohno's theorem as follows. 

Theorem 4.1. For any word w & S)^ and nonnegative integer i, 

((hi ■ w) = C{K ■ riw)). 

Recall that the /i„ are divided powers, i.e., A{hn) = Ylii+j=n^i- ® Oiily ^1 = -^i 
is primitive, in which case we recover the derivation theorem. Taking h^ = 1 gives the 
duality theorem, and with a little manipulation the sum theorem can also be obtained. 
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M. Kaneko sought to generalize the derivation theorem in another way. One can 
formulate the derivation theorem as saying that {tDt — D){w) G ker^ for all w G S)^. 
Are there derivations of higher degree for which this is still true? Kaneko defined a 
degree-n derivation 9„ of Q{x,y) by 

dn{x) = -dn{y) = x{x + vY'^y, 

and conjectured that dn{w) G ker C for all w G Sj^. Note di = tDt — D, so the conjecture 
holds for n = 1] and the case n = 2 follows easily from Theorem 4.1. 

Eventually Kaneko and K. Ihara proved the conjecture [22] by showing it equivalent 
to Theorem 4.1. One way to see this involves the action we have just defined. Extend 
the action of QSym on to an action of QSym[[t]] on S)[[t]] in the obvious way, and (as 
in the previous section) let 

H{t) = l + hit + h2t^ + ■ ■ ■ G QSym[[t]] 

be the generating function of the complete symmetric functions. If we set crt{u) = H{t) ■ u 
for M G ^, then Theorem 4.1 is equivalent to C,{at{u) — <^t{u)) = for u G io, where 
at = TatT. Now at is an automorphism of in fact a^^{u) = E{—t) ■ u, where 

E{t) = l+yt + yH + -- - e QSym[[t]] 

is the generating function of the elementary symmetric functions. Thus, Theorem 4.1 is 
equivalent to 

atat^^{u) — M G ker ( 
for all u G Then following result implies Kaneko's conjecture. 

Theorem 4.2. 



oo 



atat^ = exp 



This result can be proved by showing both sides are automorphisms of Sj[[t]] that fix 
t and X + y, and take x to x{l — ty)~^ (see |2I])- The derivations dn are related to the 
derivations Dn mentioned above as follows. Since 



the map at can also be written 



n— 1 




at = exp 

Hence Theorem 4.2 says that 



n \ — ' n \ — ' n 

yn=l / \n=l / \ n=l 
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where Dn = fDnT. Thus, the 9„ can be written in terms of the and via the 
Campbell-Hausdorff formula. For example, 

d2 = D2-D2-[D,,D,], 

and 

d3 = D3-D3- ^[D,, D2] - ^[D2, A] + lllDi, D^], D,] - ^[D,, [D,, D,]]. 

5 Cyclic Derivations 

There is an analogue of the derivation theorem involving a "cyclic derivation" C : ^ — i). 
We can define C as the composition JIC, where C : S) ^ ^ is the derivation sending 
X to and y to y ® x, and /I(a ®h) = ha. Here we regard S) ® a.s a. two-sided module 
over i3 via a(h ® c) = ah® c and (a ® h)c = a®hc. Thus, e.g., 

C{x^yxy) = Jl{x^{y ® x)xy + x^yx{y ® x)) 
= Ji{x^y ® x^y + x^yxy ® x) 
= x^yx'^y + x^yxy. 

This particular definition follows D. Voiculescu's version of the cyclic derivative |HZ]: cyclic 
derivatives were first studied by Rota, Sagan and Stein [5^ . 

In terms of the composition notation, C differs from D in that the entries are permuted 
cychcally, e.g., 

D(4, 2) = (5, 2) + (4, 3) versus C(4, 2) = (5, 2) + (3, 4). 

The following result was conjectured by myself and proved by Ohno j21j . 

Theorem 5.1. For any word w E S)^ that is not a power of y, 

aC{w)) = CirCriw)). 

As mentioned in the introduction, the proof uses partial fractions. 
The difference between C and D is most striking when applied to periodic words. For 
example. Theorem 5.1 applied to w = (x^y)" gives (in the composition notation) 

C(4,3,...,3) = C(3,3,...,3,1) + C(2,3,...,3,2). 

Theorem 5.1 also gives a very nice proof of the sum theorem. Here is the idea: Let 
u = X + ty. Then the coefficient of in xu"'~'^y is the sum of all words w E Sj^ with 
\w\ = n and i{w) = k. Now 

C(m"-1) = {n~ l)txu''-^y while tCt{u''-^) = {n - l)xu''-'^y, 

so the cyclic derivation theorem implies that C applied to the coefficient of t^^^ equals C 
applied to the coefficient of t^. That is, the sum of MZVs of fixed weight n and length k 
must be independent of k (and so must be C('^))- 

Here is another corollary of Theorem 5.1, stated in terms of the action of QSym on 
Q{x,y). 
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Theorem 5.2. For m,n>l, ({Zn ■ xy"^) = ({z^ ■ xy""). 

Just as the derivation theorem extends to Theorem 4.1, it is natural to ask if the cychc 
derivation theorem can be extended. It is easy to define cyclic derivations C„ analogous to 
the Dn of the last section: just set C„ = JlCn, where Cn{x) = and Cniy) = y ® x". One 
could then try to define cyclic derivations analogous to Kaneko's derivations dn (which 
are expressible in terms of commutators of the Dn and -D„). The difficulty appears to be 
in defining the commutator of cyclic derivations. 

6 Finite Multiple Sums and Mod p Results 

In this section we consider the finite sums 
and 



77-*^ ■ ■ ■ 77.** 
n>?ii>n2>--->nfe>l 1 k 



«1 *fe 

n>ni>n2>--->nfe>l ^1 '''^k 

where the notation is patterned after that of j^; the multiple zeta values of the previous 
sections are 

C{iu ...,ik)= lim A(^i^ i ){n), 

when the limit exists (i.e., when ii > 1). 

The sums Aj{n) and Sj{n) are related in an obvious way, e.g., 

5'(4,2,l)(^) = ^(4,2,1) ('^) + ^(6,1) (^) + ^(4,3) (^) + ^{7){n)- 

We can formalize the relation as follows. For compositions /, J, we say / refines J (denoted 
I >~ J) if J can be obtained from I by combining some of its parts. Then 

Sj{n) = YMn). (7) 

Of course S(^ni){n) = A(^m){n) for all m, n. 

It will be useful to have some additional notations for compositions. We adapt the 
notation used in previous sections for words, so for / = (ii,...,?^) the weight of / is 
\I\ = ii + ■ ■ ■ + ik, and k = £{I) is the length of I. For / = (zi, . . . ,ik), the reversed 
composition {ik, ■ ■ ■ will be denoted J: of course reversal preserves weight, length and 
refinement (i.e., I J implies I ^ J). 

Compositions of weight n are in 1-to-l correspondence with subsets of {1, 2, . . . , n — 1} 
via partial sums 

{H,i2, ■ ■ ■ ,'ik) ^ {h,^ + ^2, ■ ■ ■ ,H -\ h ik~i}, 
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and J ^ J if and only if the subset corresponding to / contains that corresponding to 
J. Complementation in the power set then gives rise to an involution I ^ I*; e.g., 
(1, 1, 2)* = (3, 1). Evidently |/*| = |/| and + £{!*) = \I\ + 1. Also, / ^ J if and only 
if I* >z J*. Finally, for two compositions I and J we write I U J for their juxtaposition. 

From we have formulas for symmetric sums of Aj (n) and Sj (n) in terms of length 
one sums S(^rn){n). (Though the proofs in ^\ are given for infinite series, they carry over 
to the finite case.) They require some notation to state. For a partition 11 = {Pi, . . . ,Pi} 
of the set {1,2,..., k}, let 

I I 
c(n) = JJ(cardP, -1)! and c(n) = (-1)^-' JJ(card - 1)!, 

s=l s=l 

and if also / = {ii, . . . ,ik) is a composition of length k, let 

I 

S{n,U,I) = YlS(p^){n), where ps = ^ij. 

S = l jdPs 

If / = (ii, . . . , Zfc) is a composition of length k, then elements a G of the symmetric 
group act on / via a ■ I = (v(i), . . . , Then Theorems 2.1 and 2.2 of [Hj give us the 

following result. 

Theorem 6.1. For all positive integers k and n and compositions I of length k, 

J2sMn)= Yl c(n)5(n,n,/) 

(TSSfe partitions of {1, . . . ,k} 

J2AMn)= Yl c(n)5(n,n,/) 

crgSj; partitions of {1, . . . ,k} 

Because of the correspondence between compositions and noncommutative words in 
we have (for any fixed n) a map p„ : ^""^ — > Q sending w E S^^ to Aj^w){n), where I{w) 
is the composition associated with w. Note that pn is the composition evoT o 0„, where 
(pn is the map defined in Section 3, T is the automorphism of QSym sending Mj to Mj, 
and ev is the function that sends ti to i. Thus, p„ : (^^,*) — R is a homomorphism. 
We can combine the homomorphisms pn into a homomorphism p that sends w E Sj^ to 
the real- valued sequence n — >• pn{w). We shall write Aj for the real- valued sequence 
n Ai{n) (and similarly for Sj), so p sends w to Ak^^). 

Now QSym has various integral bases besides the Mj. In the literature one often sees 
the fundamental quasi-symmetric functions 

Fr = YMj, 

but we will be concerned with what we call the "essential" quasi-symmetric functions 

j<i 
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In view of equation (7), the homomorphism p„ sends Ej to Si{n). 

Since QSym is a commutative Hopf algebra, its antipode 5* is an automorphism of 
QSym and S*^ = id. Now S can be given by the following explicit formulas: for proof see 
^ or [19j . 

Theorem 6.2. The antipode S o/QSym is given by 

1. SiMj) = E/,u/,u...u/.=/(-l)'M,,M,, ■ ..Mj- 

2. S{Mj) = i-iy^'^Ej. 

Part (2) of this result says that the Ej have essentially the same multiplication rules 
as the Mj: if T is the automorphism of QSym defined above, then SoT takes any identity 
among the Mj to an identity among the Ej that differs only in signs. For example, since 

M(2)M(3) = M(2,3) + M(3,2) + M(5) 

we have 

E{2)E{Z) = -E(2,3) + -£^(3,2) - -£^(5)- 

Now define an automorphism ip of Q(x, ?/) by 

ilj{x)=X + y, ^{y) = -y 

Evidently ip"^ = id, and ip{Sj^) = S^^. Thus ip defines a linear involution of = QSym 
(which is not, however, a homomorphism for the *-product). We can describe the action 
of ip on the integral bases for QSym as follows. 

Theorem 6.3. For any composition I, 

1. ij{Mj) = {-ly^^^Fj 

2. ipiEi) = -Ei* 

Proof. Suppose w = w{I) is the word in x and y corresponding to a composition /. Then 
evidently substituting y in place of any particular factor x in w corresponds to splitting a 
part of /. With this observation, part (1) is clear (there is also one factor of —1 for each 
occurrence of y in w). 

Now we prove part (2). We have 

from part (1). From Example 1 of ^H], S{Fj) = (— where S is the antipode of 
QSym. Thus 

s^j) = 5^(i/(^)+iV,-. = -J2{-lY^''^Fj. 

J*^I* Kyi* 
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Now by Mobius inversion, 

Fi = ^Mj implies = 

i^j i<j 

and so 

Apply S be both sides to get 

□ 

We consider two operators on the space of real- valued sequences. First, there is 
the partial-sum operator S, given by 



'Za(n) = a{i) 

for a G R^. Second, there is the operator V given by 

Va(n) = f''V-ira(0. 

i=0 



It is easy to show that S and V generate a dihedral group within the automorphisms of 
R^, i.e., = id and SV = VS'^ It follows that (SV)^ = id. We have the following 
result on multiple sums. 

Theorem 6.4. For any composition I, SVS*/ = —Sj*. 

Proof. We proceed by induction on |/|. The weight one case is SV5'(i) = VS^^S'(i) = 
-5(1), i.e. 

\k 



^ k [k ^k' 



fin) 



which is a classical (but often rediscovered) formula; it actually goes back to Euler |9, . For 
I = (^1,^25 • • • y'ik), it is straightforward to show that VSj{n) = f{n), where / G R^ 
is given by 

%2,...,»fc)("')> if ^1 = 1; 

S"^S'(jj_i,i2,...,i^,)(ra), otherwise. 

Now suppose the theorem has been proved for all / of weight less than n, and let 
I = {ii, . . . ,ik) have weight n. There are two cases: ii = 1, and ii > 1. In the first case, 
let (^2, • • • , ik)* = J = (ji, . . . , jr)- By the assertion of the preceding paragraph and the 
induction hypothesis, 
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But evidently /* = (ji + 1,^25 ■ ■ ■ ,jr), so the theorem holds in this case 
If ii > 1, we instead write (zi — 1, ■ ■ ■ , "^/c)* = J = {ji, ■ ■ ■ ,jr)- Then 

EVSj{n) = S(-VS-iSj.(n)) = S(-SVSj.H) = -S(-Sj(n)) = -5(i,,,,...,,„)(n). 
n n n 

But in this case /* = (1, ji, . . . , >), so the theorem holds in this well. □ 

The proof of the preceding result is essentially a formalization of the procedure in 
App. B of [36jj. (For a recent occurrence of the special case / = (1, 1, 1) as a problem, see 
|15j.) Theorem 6.4, together with part (2) of Theorem 6.3, says that the diagram 

QSym — QSym 



RN RN 

commutes. 

For the rest of this section, we discuss mod p results about Si{p — 1) and Ai{j> — 1), 
where p is a prime. (Some results of this type appear in [IHl-) For prime p, the sums 
Ajip — 1) and Si{p—1) contain no factors of p in the denominators, and can be regarded 
as elements of the field Ta/pTa. The following result about length one harmonic sums is 
well known (cf. pi, pp. 86-88). 

Theorem 6.5. S(^k){p — 1) = mod p for all prime p > k + 1. 

Because Theorem 6.1 expresses symmetric sums of Si{p — 1) and Aj{p — 1) in terms 
of length one sums, any such symmetric sum is zero mod p for p > |/| + 1. In particular, 
for I = {k, k, . . . , k) (r repetitions), we have 

Ai{p — 1) = Si{p — 1) = mod p 

for prime p > rk + 1 (cf. Theorem 1.5 of [101 )• There is the following result relating sums 
associated to / and I (cf. Lemma 3.2 of [ID])- 

Theorem 6.6. For any composition I, Aj(p — 1) = (— 1)''^'A/(j9 — 1) mod p, and similarly 
Si{p-l) = {-iy^^Sj{p-l) mod p. 

Proof. Let I = {ii, . . . ,ik). Working mod p, we have 

1 { 1 hik 



W ■■■ h 



0<bi<-<bk<p 

and similarly for Si. □ 
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An immediate consequence is that Si{p — 1) = Aj{p — 1) = mod p if I = I and 
|/| is odd. Another consequence is that S(^ij){p — 1) = y4(jj)(p — 1) = mod p when 
p > i + i + 1 and i + j is even. This is because 

S(^ij){p - 1) + S{j^i){p - 1) = mod p 

for p > 2 + j + 1 by Theorem 6.1, while S(^ij){p — 1) = Si^j^i){p — 1) mod p when i + j is 
even by Theorem 6.6. 

We have the following result relating Si and Sj* . 

Theorem 6.7. Si{p — 1) = —Si*{p — 1) mod p for all primes p. 

Proof. Let / be a sequence. From the definition of V 



-/(")-eC;:0'- 



1)7(0 



j=0 

so taking n = p — 1 gives 

SV/(p - 1) = (-ir V(P - 1) = f{v - 1) mod p. 
Now take f = Sj and apply Theorem 6.4. □ 

This result has the following corollary for the Aj, which may be compared with The- 
orem 4.4 of [17j. (We use superscripts for repetition, so (n, l'^) means the composition of 
weight n + k with k repetitions of 1.) 

Theorem 6.8. If p is a prime with p > max{A; + l,n}, then 

^(n,i'')(p - 1) = - 1) mod p. 

Proof. First note that {n, l'^)* = (1"~^, k + 1). So, combining Theorems 6.7 and 6.6, 

S(^r^,MP - 1) = -^(l«-i,fe+l)(p - 1) = (-l)"+'=+^%+i,in-l)(p - 1) mod p. (9) 

Now equate the right-hand sides of parts (1) and (2) of Theorem 6.2 and then apply pp_i 
to get 

i-if'^Siip-i)= J2 (-i)%.(p-i)---A(p-i) 

IlU-Uli=I 

for any composition /; if we set / = (1*^,77,), the hypothesis insures that all the terms on 
the right-hand side are zero mod p except the one with / = 1, giving (— 1)'^S'(„ ife)(p— 1) = 

^{i'',n){p ~ 1) mod p. Apply Theorem 6.6 to get S(^n,i''){p ^ 1) = (~l)"^(n.i'=)(p ~ 1) 
mod p. By the same argument, in-i)(p— 1) = (— l)'^+^A(fc+i in-i)(p — 1) mod p, and 
equation (9) gives the conclusion. □ 
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We can state Theorem 6.7 in algebraic language as follows. Define, for each prime p, a 
map Xp '■ -^z ~^ Z/pZ by Xp{'^) — Pp-i{w). (Here S)^ the integral version of S^^, i.e., the 
graded Z-module in Z{x,y) generated by words ending in y.) The commutative diagram 
(8) gives the following algebraic version of Theorem 6.7, which can be considered a mod 
p counterpart of the duahty theorem for MZVs. 

Theorem 6.9. As elements ofZ/pZ, Xp(w) = XplV'l'"^)) foi^ words w of S)^. 
For example, since ip{x^y^) — —x^y^ — xy^ — yxy^ — y^, we have 

^(3,1,1) (P- 1) = -^(3,1,1) (P- 1) -^(2,1,1,1) (P- 1) -^(1,2,1,1) (P- 1) -^(1,1,1,1,1) (P- 1) inod p. 

For p > 6 this reduces to 2A(3,i,i)(p - 1) = -^(2,1,1,1) (p - 1) - ^(1,2,1,1) (p - 1) mod p. 
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